69 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
Bosnian Croatian Indonesian Malay Serbian
Availability:
Freely Available
License:
<Not Specified>
Size:
29.1 MByte Production Status:
Existing-updated
Use:
Discriminating Similar Language
-
Paper title:Merging Comparable Data Sources for the Discrimination of Similar Languages: The DSL Corpus Collection
-
Paper track:<Not Specified>
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Marcos Zampieri | Saarland University | DE |
| Author 2 | Nikola Ljubesic | University of Zagreb | HR |
| Author 3 | Jorg Tiedemann | University of Uppsala | SE |
| Main Contact | Liling Tan | Rakuten Institute of Technology | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian
Availability:
Freely Available
License:
OpenSource
Size:
404 KByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Evaluation of Croatian Word Embeddings
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Lukas Svoboda | University of West Bohemia | CZ |
| Author 2 | Slobodan Beliga | Department of Informatics, University of Rijeka | HR |
| Main Contact | Lukas Svoboda | University of West Bohemia | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian English
Availability:
Freely Available
License:
<Not Specified>
Size:
87024 entries Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Comparing two acquisition systems for automatically building an English–Croatian parallel corpus from multilingual websites
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Miquel Esplà-Gomis | Universitat d'Alacant | ES |
| Author 2 | Filip Klubička | University of Zagreb | HR |
| Author 3 | Nikola Ljubešić | University of Zagreb | SI |
| Author 4 | Sergio Ortiz-Rojas | Prompsit Language Engenering | ES |
| Author 5 | Vassilis Papavassiliou | Institute for Language and Speech Processing / RC Athens | GR |
| Author 6 | Prokopis Prokopidis | Institute for Language and Speech Processing/Athena RC | GR |
| Main Contact | Miquel Esplà-Gomis | Universitat d'Alacant | None |
Documentation:
Documentation in English is publicly available at http://redmine.abumatran.eu/projects/en-hr-tourism-corpus/documents
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian
Availability:
Freely Available
License:
CC BY-SA 3.0
Size:
4000 Production Status:
Newly created-in progress
Use:
<Not Specified>
-
Paper title:The SETimes.HR Linguistically Annotated Corpus of Croatian
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Željko Agić | IT University of Copenhagen | DK | University of Zagreb | DK |
| Author 2 | Nikola Ljubešić | University of Zagreb | SI | ||
| Main Contact | Željko Agić | IT University of Copenhagen | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Croatian English
Availability:
From Data Center(s)
License:
IPR of the AV-resources is with the owner: Documenta. The IPR of the metadata is opensource via DANS
Size:
600 Production Status:
Newly created-in progress
Use:
Oral History
-
Paper title:Croatian Memories
-
Paper track:Speech
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Arjan van Hessen | University of Twente | NL |
| Author 2 | Franciska de Jong | University of Twente | NL |
| Author 3 | Stef Scagliola | Erasmus University Rotterdam | NL |
| Author 4 | Tanja Petrovic | Documenta | HR |
| Main Contact | Arjan van Hessen | University of Twente | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Croatian
Availability:
From Owner
License:
<Not Specified>
Size:
4626 Production Status:
Newly created-finished
Use:
<Not Specified>
-
Paper title:Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Željko Agić | IT University of Copenhagen | DK | University of Zagreb | DK |
| Author 2 | Daša Berović | University of Zagreb | HR | ||
| Author 3 | Danijela Merkler | University of Zagreb | HR | ||
| Author 4 | Marko Tadić | University of Zagreb, Faculty of Humanities and Social Sciences | HR | ||
| Main Contact | Željko Agić | IT University of Copenhagen | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Trilingual
Languages:
Bosnian Bulgarian Croatian
Availability:
Freely Available
License:
CC0 1.0 Universal
Size:
89 MByte Production Status:
Existing-used
Use:
Language Identification
-
Paper title:Using Adversarial Examples in Natural Language Processing
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Petr Bělohlávek | Charles University | CZ |
| Author 2 | Ondřej Plátek | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | CZ |
| Author 3 | Zdeněk Žabokrtský | Charles University in Prague | None |
| Author 4 | Milan Straka | Charles University | None |
| Main Contact | Petr Bělohlávek | Charles University | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian English
Availability:
Freely Available
License:
<Not Specified>
Size:
55083246 words Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Nikola Ljubešić | University of Zagreb | HR | ||
| Author 2 | Miquel Esplà-Gomis | Universitat d'Alacant | ES | ||
| Author 3 | Antonio Toral | Dublin City Unversity | IE | ||
| Author 4 | Sergio Ortiz Rojas | <Not Specified> | None | ||
| Author 5 | Filip Klubička | University of Zagreb | HR | ||
| Main Contact | Nikola Ljubešić | Jožef Stefan Institute | None | University of Zagreb | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
Croatian
Availability:
Freely Available
License:
CreativeCommons
Size:
10000 words Production Status:
Newly created-in progress
Use:
Word Sense Disambiguation
-
Paper title:Graph-Based Induction of Word Senses in Croatian
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Marko Bekavac | University of Zagreb, Faculty of Electrical Engineering and Computing | HR |
| Author 2 | Jan Šnajder | University of Zagreb, Faculty of Electrical Engineering and Computing, Unska 3, 10000 Zagreb | HR |
| Main Contact | Jan Šnajder | University of Zagreb, Faculty of Electrical Engineering and Computing, Unska 3, 10000 Zagreb | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Trilingual
Languages:
Croatian Serbian Slovenian
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Diacritics Restoration Using Neural Networks
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jakub Náplava | Charles University, Institute of Formal and Applied Linguistics | CZ | ||
| Author 2 | Milan Straka | Charles University | None | ||
| Author 3 | Pavel Straňák | Charles University in Prague | CZ | ||
| Author 4 | Jan Hajic | Charles University in Prague | CZ | Charles University | CZ |
| Main Contact | Jakub Náplava | Charles University, Institute of Formal and Applied Linguistics | None |
Documentation:
<Not Specified>




